Search CORE

99 research outputs found

On Multilingual Training of Neural Dependency Parsers

Author: EM Bender
J Edmonds
J Nivre
M Schuster
N Srivastava
R Caruana
W Ammar
Publication venue
Publication date: 29/05/2017
Field of study

We show that a recently proposed neural dependency parser can be improved by joint training on multiple languages from the same family. The parser is implemented as a deep neural network whose only input is orthographic representations of words. In order to successfully parse, the network has to discover how linguistically relevant concepts can be inferred from word spellings. We analyze the representations of characters and words that are learned by the network to establish which properties of languages were accounted for. In particular we show that the parser has approximately learned to associate Latin characters with their Cyrillic counterparts and that it can group Polish and Russian words that have a similar grammatical function. Finally, we evaluate the parser on selected languages from the Universal Dependencies dataset and show that it is competitive with other recently proposed state-of-the art methods, while having a simple structure.Comment: preprint accepted into the TSD201

arXiv.org e-Print Archive

Crossref

A derivational model of discontinuous parsing

Author: D Hays
F Jelinek
H Gaifman
J Nivre
M Collins
M Kuhlmann
V Sornlertlamvanich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The notion of latent-variable probabilistic context-free derivation of syntactic structures is enhanced to allow heads and unrestricted discontinuities. The chosen formalization covers both constituent parsing and dependency parsing. The derivational model is accompanied by an equivalent probabilistic automaton model. By the new framework, one obtains a probability distribution over the space of all discontinuous parses. This lends itself to intrinsic evaluation in terms of perplexity, as shown in experiments.Postprin

Crossref

University of St. Andrews - Pure

St Andrews Research Repository

The EVALITA Dependency Parsing Task: from 2007 to 2011

Author: C. Bosco
C. Bosco
F.M. Zanzotto
G. Attardi
J. Nivre
R. Hudson
S. Montemagni
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Institutional Research Information System University of Turin

Comparing the Inﬂuence of Different Treebank Annotations on Dependency Parsing

Author: Attardi G.
Bosco Cristina
dell&#8217
Hall J.
Lavelli A.
Lenci A.
Lesmo Leonardo
Lombardo Vincenzo
Mazzei Alessandro
Montemagni S.
Nilsson J.
Nivre J.
Simi M.
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2010
Field of study

Institutional Research Information System University of Turin

Taking SPARQL 1.1 extensions into account in the SWIP system

Author: C. Bizer
C. Comparot
C. Pradel
C. Pradel
F. Alkhateeb
J. Nivre
N. Athanasis
Q. Zhou
S. Elbassuoni
S. Ferré
V.I. Levenshtein
Y. Lei
Publication venue: HAL CCSD
Publication date: 01/01/2013
Field of study

International audienceThe SWIP system aims at hiding the complexity of expressing a query in a graph query language such as SPARQL. We propose a mechanism by which a query expressed in natural language is translated into a SPARQL query. Our system analyses the sentence in order to exhibit concepts, instances and relations. Then it generates a query in an internal format called the pivot language. Finally, it selects pre-written query patterns and instantiates them with regard to the keywords of the initial query. These queries are presented by means of explicative natural language sentences among which the user can select the query he/she is actually interested in. We are currently focusing on new kinds of queries which are handled by the new version of our system, which is now based on the 1.1 version of SPARQL

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

Hindi CCGbank: CCG Treebank from the Hindi Dependency Treebank

Author: A Bharati
A Joshi
A Mahajan
B Kumari
Bharat Ram Ambati
C Shastri
D Hays
J Hockenmaier
J Nivre
J Robinson
M Kuhlmann
M Lewis
M Palmer
M Steedman
Mark Steedman
MP Marcus
N Xue
S Clark
S Reddy
S Uematsu
T Mohanan
Tejaswini Deoskar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Crossref

Springer - Publisher Connector

Edinburgh Research Explorer

Building the essential resources for Finnish: the Turku Dependency Treebank

Author: A. B. Clegg
Anna Missilä
D. Cohn
Filip Ginter
J. Björne
J. Nivre
J. Nivre
Jenna Nyblom
Katri Haverinen
L. Qian
M. L. Helasvuo
M. Marcus
M. Palmer
Samuel Kohonen
Stina Ojala
Tapio Salakoski
Timo Viljanen
Veronika Laippala
Y. Miyao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Integrating isotopes and documentary evidence : dietary patterns in a late medieval and early modern mining community, Sweden

Author: A Bogaard
A Kjellström
A Linderholm
A Onsten-Molander
A Rand
AL Lamb
Anne Ingvarsson
AR Lieverse
AW Froehle
B Boethius
C Daniell
C Polet
C Warinner
C Woolgar
C Woolgar
C Yoder
C Yoder
CA Makarewicz
CC Dyer
DC Salazar-Garcia
DK Whittaker
DM Hadley
E Wåghäll Nivre
EM Wild
G Eriksson
G Malmstedt
G Müldner
G Müldner
G Müldner
G Müldner
G Utterström
GJ Klinken Van
H Bergold
H Bocherens
J Barrett
J Lindkvist
J Mispelaere
J Sealy
J Sundin
Jan Mispelaere
JR Lukacs
K Britton
K Lidén
K Salesse
Kate Britton
KC Olsen
KJ Knudson
L Hagberg
LJ Reitsema
LJ Reitsema
LS Bell
M Deniro
M Fjellström
M Morell
M Morell
M Novak
M Zvelebil
MA Katzenberg
Markus Fjellström
ME Lewis
MJ DeNiro
MJ Schoeninger
MJ Schoeninger
MLS Jørkov
MM Alexander
MP Richards
MP Richards
MW Adamson
N Sykes
OS Johansen
P Iacumin
R Carlsson
R Ciaffi
R Longin
R Meurman
RA Joyce
REM Hedges
S Curtis-Summers
S Jiménez-Brobeil
SA Mays
SH Ambrose
SH Ambrose
SH Ambrose
SW Hillson
TA Brown
TP Price
U Albarella
WJ Pestle
WJ Pestle
Y Bäckström
Y Bäckström
Y Bäckström
Y Bäckström
Ylva Bäckström
Z Palubeckaité
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/07/2017
Field of study

We would like to thank the Archaeological Research Laboratory, Stockholm University, Sweden and the Tandem Laboratory (Ångström Laboratory), Uppsala University, Sweden, for undertaking the analyses of stable nitrogen and carbon isotopes in both human and animal collagen samples. Also, thanks to Elin Ahlin Sundman for providing the δ13C and δ15N values for animal references from Västerås. This research (Bäckström’s PhD employment at Lund University, Sweden) was supported by the Berit Wallenberg Foundation (BWS 2010.0176) and Jakob and Johan Söderberg’s foundation. The ‘Sala project’ (excavations and analyses) has been funded by Riksens Clenodium, Jernkontoret, Birgit and Gad Rausing’s Foundation, SAU’s Research Foundation, the Royal Physiographic Society of Lund, Berit Wallenbergs Foundation, Åke Wibergs Foundation, Lars Hiertas Memory, Helge Ax:son Johnson’s Foundation and The Royal Swedish Academy of Sciences.Peer reviewedPublisher PD

Aberdeen University Research

Lund University Publications

Crossref

Publikationer från Uppsala Universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

MPG.PuRe

Splitting Arabic Texts into Elementary Discourse Units

Author: Abdul-Mageed M.
Abu-Jbara A.
Afantenos S.
Afantenos S. D.
Al-Saif A.
Al-Saif A.
Belguith H. L.
Boujelben I.
Charoensuk J.
Da Cunha I.
Darwish K.
Diab M.
Diab M.
Eskander R.
Farah Benamara Zitoune
Fisher S.
Green S.
Gridach M.
Habash N.
Iskandar Keskes
Kamp H.
Keskes I.
Khalifa I.
Lamia Hadrich Belguith
Lüngen H.
Maamouri M.
Maamouri M.
Mourad A.
Nivre J.
Polanyi L.
Prasad A.
Sadat F.
Sawalha M.
Subba R.
Sumita K.
Tofiloski M.
Trigui O.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2014
Field of study

International audienceIn this article, we propose the first work that investigates the feasibility of Arabic discourse segmentation into elementary discourse units within the segmented discourse representation theory framework. We first describe our annotation scheme that defines a set of principles to guide the segmentation process. Two corpora have been annotated according to this scheme: elementary school textbooks and newspaper documents extracted from the syntactically annotated Arabic Treebank. Then, we propose a multiclass supervised learning approach that predicts nested units. Our approach uses a combination of punctuation, morphological, lexical, and shallow syntactic features. We investigate how each feature contributes to the learning process. We show that an extensive morphological analysis is crucial to achieve good results in both corpora. In addition, we show that adding chunks does not boost the performance of our system

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte